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DETERMINE A COMBINED CHANNEL RESPONSE MATRIX, A, 
USING THE ESTIMATED RESPONSES, {j w , AND THE 
SPREADING CODES, c N , FOR EACH BURST 



FORM THE COMBINED CHANNEL CORRELATION MATRIX R « A H A 



A K ST K MATRIX, A w . IS DETERMINED BY TAKING AN 
FFT OF BLOCK ENTRIES OF A BLOCK COLUMN Of R, 
PRffERRABLY. A CENTRAL COLUMN 



A HERMET1AN OF THE COMBINED CHANNEL RESPONSE 
MATRIX. A " . IS MULTIPLIED TO THE RECEIVED VECTOR, 
I . AND A FFT OF A RESULT OF THE MULWUCATTON IS 

TAKEN AT EACH FREQUENCY PONT; ALTERNATELY. 
|F ( **' ) K CAN BE FORMED BY TAKING AN FFT OF A 
BLOCK COLUMN OF A AND MULTIPlYMO IT WTH 
A* FFT Of i. 



(57) Abstract: K data signals are transmitted over a shared spectrum in a 
code division access communication system. A combined signal is received 
and sampled over the shared spectrum (48). The combined signal has the 
K transmitted data signals (48). A combined channel response matrix is 
produced (49) using the codes and impulse responses of the K transmitted 
data signals. A block column of a combined channel correlation matrix is 
determined using the combined channel response matrix (50). Each block 
entry orthe block column is a K by K matrix (51). At each frequency point 
k, a K by K matrix is determined by taking the Fourier transform of the 
block entries of the block column (52). An inverse of a K by K matrix is 
multiplied to a result of the Fourier transform (52). Alternately, forward 
substitution (53) and backward substitution (54) can be used to solve the 
system. An inverse I^ouricr transform is used to recover the data from the 
Kdala signals (55). 



NO 

©\ 



O 



52 



53 



perform lu oc compost** on i". a* • iu. wca 

I ft A U*CX TOANOUUft MATRn IND U B AM UPPW 

vmwj* ma Twi a 



I 



rt 



1 _ 



MfWftQM W ML* 

<riiik.u'|rtiif.«i 



55 





jtx m ariruin «t» 






Ml DATA VECTOR 





WO 02/089346 Al IlillilillllillMlil 



GB, GR, IE, IT, LU, MC, NL, PT, SE, TR), OAPI patent For two-letter codes and other abbreviations, refer to the "Guid- 

(BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, ML, MR, ance Notes on Codes and Abbreviations'* appearing at the begin- 

NE, SN, TD, TG). ning of each regular issue of the PCT Gazette. 

Published: 

— with international search report 




WO 02/089346 



PCT/US02/13266 



[0001] 



FAST JOINT DETECTION 



[0002] This application claims priority to U.S. Provisional Patent Application No. 
60/287,431, filed on April 30, 2001. 



[0004] The invention generally relates to wireless communication systems. In 
particular, the invention relates to data detection in a wireless communication system. 
[0005] Figure 1 is an illustration of a wireless communication system 10. The 
communication system 10 has base stations 12! to 12 5 (12) which communicate with 
user equipments (UEs) 14, to 14 3 (14). Each base station 12 has an associated 
operational area, where it communicates with UEs 14 in its operational area. 
[0006] In some communication systems, such as frequency division duplex using 
code division multiple access (FDD/CDMA) and time division duplex using code 
division multiple access (TDD/CDMA), multiple communications are sent over the 
same frequency spectrum. These communications are differentiated by their 
channelization codes. To more efficiently use the frequency spectrum, TDD/CDMA 
communication systems use repeating frames divided into timeslots for 
communication. A communication sent in such a system will have one or multiple 
associated codes and timeslots assigned to it. 

[0007] Since multiple communications may be sent in the same frequency spectrum 
and at the same time, a receiver in such a system must distinguish between the multiple 
communications. One approach to detecting such signals is multiuser detection 
(MUD). In MUD, signals associated with all the UEs 14, users, are detected 
simultaneously. Another approach to detecting a multi-code transmission from a single 
transmitter is single user detection (SUD). In SUD, to recover data from the multi- 
code transmission at the receiver, the received signal is passed through an equalization 
stage and despread using one or multiple codes. Approaches for implementing MUD 
and the equalization stage of SUD include using a Cholesky or an approximate 
Cholcsky decomposition. These approaches have a high complexity. The high 
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complexity leads to increased power consumption, which at the UE 14 results in 
reduced battery life. Accordingly, it is desirable to have alternate approaches to 
detecting received data. 

[0008] SUMMARY 

[0009] K data signals are transmitted over a shared spectrum in a code division 
multiple access communication system. A combined signal is received and sampled 
over the shared spectrum. The combined signal has the K transmitted data signals. A 
combined channel response matrix is produced using the codes and impulse responses 
of the K transmitted data signals. A block column of a combined channel correlation 
matrix is determined using the combined channel response matrix. Each block entry 
of the block column is a K by K matrix. At each frequency point k, a K by K matrix 
A 00 is determined by taking the fourier transform of the block entries of the block 
column. An inverse of A 00 is multiplied to a result of the fourier transform. 
Alternately, forward and backward substitution can be used to solve the system. An 
inverse fourier transform is used to recover the data from the K data signals. 

[0010] BRIEF DESCRIPTION OF THE DRAWING(S) 

[001 1] Figure 1 is a wireless communication system. 

[0012] Figure 2 is a simplified transmitter and a fast joint detection receiver. 

[0013] Figure 3 is an illustration of a communication burst. 

[0014] Figure 4 is a flow chart of a preferred embodiment for fast joint detection. 

[0015] Figure 5 is an illustration of a data burst indicating extended processing 

areas. 

[0016] Figures 6-1 1 are graphs illustrating the simulated performance of fast joint 
detection to other data detection approaches. 




[0017] DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT(S) 
[0018] Figure 2 illustrates a simplified transmitter 26 and receiver 28 using fast 
joint detection in a TDD/CDMA communication system, although fast joint detection 
is applicable to other systems, such as FDD/CDMA. In a typical system, a transmitter 
26 is in each UE 14 and multiple transmitting circuits 26 sending multiple 
communications are in each base station 12. The joint detection receiver 28 may be 
at a base station 12, UEs 14 or both. 

[0019] The transmitter 26 sends data over a wireless radio channel 30. A data 
generator 32 in the transmitter 26 generates data to be communicated to the receiver 
28. A modulation/spreading/training sequence insertion device 34 spreads the data 
with the appropriate code(s) and makes the spread reference data time-multiplexed 
with a midamble training sequence in the appropriate assigned time slot, producing a 
communication burst or bursts. 

[0020] A typical communication burst 1 6 has a midamble 20, a guard period 1 8 and 
two data fields 22, 24, as shown in Figure 3. The midamble 20 separates the two data 
fields 22, 24 and the guard period 1 8 separates the communication bursts to allow for 
the difference in arrival times of bursts transmitted from different transmitters 26. The 
two data fields 22, 24 contain the communication burst's data. 
[0021] The communication burst(s) are modulated by a modulator 36 to radio 
frequency (RF). An antenna 38 radiates the RF signal through the wireless radio 
channel 30 to an antenna 40 of the receiver 28. The type of modulation used for the 
transmitted communication can be any of those known to those skilled in the art, such 
as quadrature phase shift keying (QPSK) or M-ary quadrature amplitude modulation 
(QAM). 

[0022] The antenna 40 of the receiver 28 receives various radio frequency signals. 
The received signals are demodulated by a demodulator 42 to produce a baseband 
signal. The baseband signal is sampled by a sampling device 43, such as one or 
multiple analog to digital converters, at the chip rate or a multiple of the chip rate of 
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the transmitted bursts. The samples are processed, such as by a channel estimation 
device 44 and a fast joint detection device 46, in the time slot and with the appropriate 
codes assigned to the received bursts. The channel estimation device 44 uses the 
midamble training sequence component in the baseband samples to provide channel 
information, such as channel impulse responses. The channel impulse responses for 
all the transmitted signals can be viewed as a matrix, H. The channel information is 
used by the fast joint detection device 46 to estimate the transmitted data of the 
received communication bursts as soft symbols. 

[0023] The fast joint detection device 46 uses the channel information provided by 
the channel estimation device 44 and the known spreading codes used by the 
transmitter 26 to estimate the data of the desired received communication burst(s). 
[0024] Although fast joint detection is explained using the third generation 
partnership project (3 GPP) universal terrestrial radio access (UTRA) TDD system as 
the underlying communication system, it is applicable to other systems. That system 
is a direct sequence wideband CDMA (W-CDMA) system, where the uplink and 
downlink transmissions are confined to mutually exclusive timeslots. 
[0025] The receiver 28 receives a total of K bursts that arrive simultaneously. The 
K bursts are superimposed on top of each other in one observation interval. For the 
3 GPP UTRA TDD system, each data field of a time slot corresponds to one 
observation interval. A code used for a k* burst is represented as C 00 . The K bursts 
may originate from K different transmitters or for multi-code transmissions, less than 
K different transmitters. 

[0026] Each data field of a communication burst has a predetermined number of 
transmitted symbols, N s . Each symbol is transmitted using a predetermined number 
of chips, which is the spreading factor, SF. Accordingly, each data field has N s x SF 
chips. After passing through the wireless radio channel, each symbol has an impulse 
response, such as of length W chips. A typical value for W is 57. As a result, each 
received field has a length of SF xN s + W-l chips or N c chips. 




[0027] Each k m field of the K data fields in one observation interval can be modeled 
at the receiver by Equation 1 . 

£ M = A {k) d {k \k = 1—K Equation 1 

[0028] rW is the received contribution of the k* field. A 00 is the combined channel 

response for the field. A 00 is a N c x N s matrix. Each j* column in A (k) is a zero- 
padded version of the symbol response S (k) of the j* element of d (k) . The symbol 

response S°° is the convolution of the k* field ! s estimate response, , and spreading 

code C (k) for the field. d (k) is the unknown data symbols in the k* data field, is 
of length W chips and can be represented by Equation 2. 

£(*) =r (*).£(*) Equation 2 



reflects the transmitter gain and path loss. 1%^ is the channel impulse response. 
[0029] For uplink communications, each ffi k ^ as well as each Y^ are distinct. For 



the downlink, all of the fields have the same h but each j is different. If transmit 

diversity is used in the downlink, each / k) and fl * are distinct 

[0030] The overall received vector r from all K fields sent over the wireless 
channel is per Equation 3. 



* Equation 3 

L= Z L ( ) + n 

k=\ 



n is a zero-mean noise vector. 

[003 1] By combining the A 00 for all data fields into a total channel response matrix 
A and all the unknown data for each burst d w into a total data vector d, Equation 1 
becomes Equation 4. 

r = Ad + n Equation 4 

[0032] Determining d using a MMSE solution is per Equation 5. 

d = BT x {A H r^ Equations 

represents the hermetian function (complex conjugate transpose). R for a MMSE 
solution is per Equation 6. 

R = A H A + a 1 1 Equation 6 

ct 2 is the noise variance, typically obtained from the channel estimation device 44, and 
I is the identity matrix. 

[0033] Using fast fourier transforms (FFTs), although other fourier transforms may 
be used, this equation is solved, preferably, per Equation 7. 
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indicates the FFT function. The indicate that the equation is solved at each 

frequency point k. are block entries of size K by K of a block diagonal matrix 
A . The derivation of A is described subsequently. Instead of directly solving 
Equation 7, Equation 7 can be solved using forward and backward substitution. 
[0034] Figure 4 is a flow chart for a preferred method of determining the data 
vector d using fast joint detection. The combined channel response matrix A is 

determined using the estimated responses and the spreading code for each 

burst , 48. Form the combined channel correlation matrix, R - A H A , 49. At each 
frequency point, a K by K matrix A^ is determined by taking the fourier transform of 
block entries of a block column of R (block FFT), 50. Preferably, a central column is 
used, at least w columns from the left or right of the R matrix. 

[0035] ^p" r ] A . is determined using a FFT ofa matrix multiplication, 51. The 
inverse of each A (/r) , [a^J 1 , is determined. To determine [i 7 ^)]^ [a^] 1 and 
jF^^r)^ are multiplied at each frequency point Alternately, [^(s!)]^ is 

determined using LU decomposition. A**' is decomposed into a lower triangular 
matrix, L, and an upper triangular matrix, U, 52. Using forward, Ly = jF^"r)j , 53, 
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determined by an inverse FFT of F(d) , 55. 

[0036] The derivation of Equation 7 is as follows. A minimum mean square error 
solution of Equation 4 is determined per Equation 8. Although Equation 7 is a MMSE 
based solution, fast joint detection can be implemented using other approaches, such 
as a zero forcing approach. 



If a zero forcing solution is used, the a 2 1 term drops out of Equation 8, such as 
Rd = {A H A}d = A H r. The following is a derivation for the MMSE solution, although 

an analogous derivation can be used for a zero forcing solution. For illustrative 
purposes, a simplified example of R with N s = 10 and W = 2 is per Equation 9. This 
example is readily extendable to any N s and W. 
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Equation 9 

[0037] The matrix R is of size (KN S ) by (KN S ) in general. Each entry, % , in the R 

matrix is a K by K block. The sub-matrix, within the dotted lines of R, is block- 
circulant, i.e. a block- wise extension of a circulant matrix. The portion of R, which 
is not block-circulant, depends on the maximum multipath delay spread, W. 
[0038] A block-circulant extension of the matrix R, Rq, in Equation 9 is per 
Equation 10. 
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Equation 10 

[0039] A "Discrete Fourier Transform (DFT)-like" matrix D is determined such 
that^=£>AD" One such matrix D is per Equation 1 1 . 



D = 
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J K is a K by K identity matrix. 

[0040] The product D H D is per Equation 9. 



Equation 1 1 



LfD=N s I i 



Equation 12 



Ikh is a KN S by KN S identity matrix. The block circulant matrix R,. is multiplied by the 
D matrix, such as per Equation 13. 
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Equation 13 

Each entry of ^ D is a K by K block. A block-diagonal matrix A is per Equation 14. 



fA (1) 



A (2) 



A (3) 



Equation 14 



is of size (KN S ) by (KN S ). Each entry A (0 of A is per Equation 15. 
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Equation 15 



A^ is a K by K block and has K 2 non-zero entries. 

[0041] The D matrix is multiplied to the A matrix, such as per Equation 16. 
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Equation 16 



Each entry of D as shown in Equation 16 is a K by K block. 

[0042] The system of equations produced by equating each row of RJ) with each 
row of D is consistent. Accordingly, the same set of equations result by equating 

any row of RJ) with the same row of D .To illustrate for Equation 13, the first row- 
block of RJ) is per Equation 17. 
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[0043] The first row-block of D is per Equation 18. 



Equation 17 



[A (1) ,A (2) e* ,A<V' 9 /i**e * ,A ( "<>e * ] 



Equation 18 



[0044] Equating the entries of these two rows, Equations 19 and 20 result. 



A (I > = (/?o + R* + R? + R l + 7^) 



Equation 19 
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A 1 'e 



( J2il Jin j&l Me 
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A^ „// v w v 



[0045] As a result, A^ is per Equation 21 . 



Equation 20 



J7x JA/r -J4jt -J2jt 



Equation 21 



[0046] Similarly, A K_I) is per Equation 22. 
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j7jN f -2)n JW^x -j4(jV f -2);r Z jWrZ)x 

A^^^+Rfe N < +R 2 H e N ' +R 2 e' N * +R } e ) 

Equation 22 

[0047] A W is per Equation 23. 

JW^)* JW-V* zJMd^L 

A w) =(i?o+^ +Rfe N < +Rfi K +R x e * ) 

Equation 23 

Although Equations 17-23 illustrate using the first row of I^D and DA , any row can 
be used to determine the A^j . 

[0048] To illustrate using a center row, (Ng/2)* row (row 5 of Equation 7), A^ is 
per Equation 19. 

A (1) = fa + R X H + fl 2 " + /?, + # 2 ) Equation 19 

Equations 1 9-23 compute the FFT of K by K blocks. Since these blocks are multiplied 
by scalar exponentials, this process is referred to as a "block FFT." Typical 
approaches for calculating FFTs, such as the Matlab function "fift" compute the FFTs 

of a one-sided sequence. Since each A^ is a two sided sequence, the computation of A^ 

can be implemented by a fit {QA # * # »^>^>^)»^ r*-iQ,0 and multiplying it by an 
appropriate exponential function as per Equation 27 for a center row. 
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N 

[ceil£±)-l] 
where v = 



Equation 27 



[0049] As shown in Equations 17-27, calculating all the A^ can be performed 
using a single column of R. Accordingly, Rc is not required to be determined. Any 
column of R can be used to derive the A^ s directly. Preferably, a row at least W 
rows from either edge of R is used, since these rows have a full set of R^s. 
[0050] Using the A^ s and the D matrix, the block-circulant matrix R^ can be re- 
written as Equations 28 and 29. 



R C D=DA 



Equation 28 



R c = (l/N S )[DAD H ] 



Equation 29 



D and are each of size (KN S ) by (KN S ). 

[005 1] Since lfD= NJ^LT 1 = (1/N s )lf 9 Equation 30 results. 

R; ] =N s [(DVrA-\Dy)=NA^^) 

N. N. 



Equation 30 



[0052] The MMSE solution is per Equation 3 1 . 
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d = It; l (A H r) 



Equation 31 



The detected data vector d is of size (NgK) by 1. 
[0053] The MMSE solution is per Equation 32. 

D H d = A^Z^^r)] Equation 32 

The matrix is of size (KN S ) by (KN S ) with K by K blocks, and its inverse is per 
Equation 33. 
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Equation 33 



The inversion requires an inversion of the K by K matrices ^ . 
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[0054] As a result, the data vector d is determined per Equation 34. 



[^WL-KTH^L Equation 34 



Equation 34 is applicable to both receivers which sample the received signal at the 
chip rate and at a multiple of the chip rate, such as twice the chip rate. For multiple 
chip rate receivers, the R matrix corresponding to the multiple chip rate is of the same 
form as Equation 9, being approximately block circulant 

[0055] To reduce the complexity in determining F^A H rj , an FFT approach taking 

advantage of the structure of A may be used. A has an approximately block-circulant 
structure. However, it is a non-square matrix, being of size (N S SF) by (NgK). An 
illustration of a matrix A is per Equation 35. 
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Equation 35 
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Each bj (f) is the convolution of the channel response h 00 and the spreading code c w , 

for the k* user at the j" 1 chip interval of the i* symbol interval. 

[0056] Using blocks 5(-) , where each block is indicated by bracketing in Equation 

35, Equation 35 becomes Equation 36. 



A = 



5(0) 0 0 0 

5(1) 5(0) 0 0 
5(2) 5(1) 5(0) 0 



0 
0 
0 



5(2) 5(1) 5(0) 



Equation 36 



[0057] As shown, a portion of A is block-circulant. A circulant extension of A is 
denoted as A,.. 

A can be broken into three matrices per Equation 37. 

A = D l A. i D" Equation 3 7 

D, is a (N.SF) by (N,SF) matrix. D 2 is a (NjK) by (NjK) matrix and A, is a block- 
diagonal matrix of size (N,SF) by (N,K). 

[0058] The block diagonal matrix A has the same form as A of Equation 14. 
However, each of its entries A, W is a SF by K block per Equation 38. 
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A<" = 



(0 



^SFA 



(0 



(0 



3 W 



Equation 38 

[0059] Q is the same form as that of in Equation 11. L{ is of the form of 
Equation 39. 



D, = 



J4rr 
» N < 



SF 



SF 



y!8 n 

e N - 

J20 st 

e N > I 



SF 
SF 



e N * I 

e 1 SF 



SF 



J36 jt 

e 

JAO 

e »• I 



SF 



SF 



j\Z it 
J36 n 



J\62 n 



SF 



SF 



SF 



SF 



I S f is a SF by SF identity matrix. 



Equation 39 
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[0060] In multiplying and D 2 , products of the form, B(i) and e N ' I K 9 are formed 
per Equation 40. 



flx /l&r flQtt /I St /162t /I80r 

[S(0)+5(l)+3(2)] [5(0)e^ +3(2>"' ] ■•• [5(0)e"' +5(2)e * +£(1> * ] 

P£ J4n fXXz /lte /3fcr /180r 

[£(0)+3(l)+£{2)] +£(0)e"' +.S(2> "« )] ••• [£(!>> +fl(0> w ' +5(2)e ] 



44 = 



yi&r jltor /144t yi62r yi8Qr 

[5(0)+jB[0+5(2)] [B(2)e Nt +B(l)e N > +5(0>"' ] - [5(2)e N ' +B®e N ' +5(0> ] 



Equation 40 

4A is of size (N S SF) by (HK) and each block is a size SF by K. 

[0061] In multiplying D, and A, , products of the form, e N ' I SF and Aj W , are 

formed. £|A is of size (N S SF) by (NjK) and each block is of size SF by K. 
Comparing any row of AJ\ with the same row of Z}A,, Equation 41 results. 
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A, (,) = [5(0) +5(1) +5(2)], 

-J2* -JA* 

A, (2) = [5(0) + B(l)e +5(2)e N - ], 

A/"'" 0 = [5(0)+ 5(l)e *• + 5(2)e "« ], 

-J2{N,-\)n -y4(Af,-l)» 

A , (A,,) = [5(0) + B(l)e N - +B(2)e ]. 



Equation 41 



[0062] As a result, each Aj can be determined using the FFT of a one-sided 

sequence of (SF by K) blocks. Using Equation 3 8 and D 2 H D 2 = N s I^, Equations 42, 
43 and 44 result. 



A=£\A l D 2 fi , Equation 42 

A H r = D 2 Af (D, H r) Equation 43 



D?(A H r) = N s -[a? (flfr)] Equation 44 

[0063] Accordingly, [^"c^ is determined using FFTs per Equation 45. 
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Equation 45 



Similarly, since the matrix A is approximately block-circulant, R = A H A + a 2 1 can 
also be computed using FFTs using A } . 

[0064] To reduce complexity, the inversion of each aW, [aW]" 1 , can be 

performed using LU decomposition. Each [a* j is a (K by K) matrix whose LU 
decomposition is per Equation 46. 

A (i) = LU Equation 46 

[0065] L is a lower triangular matrix and U is an upper triangular matrix. Equation 
7 is solved using forward and backward substitution per Equations 47 and 48. 

[ aW = [F(i4* r)] 4 Equation 47 



y^U^l \F(d)\ Equation 48 



[0066] Preferably, to improve the bit error rate (BER) for data symbols at the ends 
of each data fields 22, 24, samples from the midamble portion 20 and guard period 1 8 
are used in the data detection as shown in Figure 5. To collect all the samples of the 
last symbols in the data fields, the samples used to determine r are extended by W-l 

chips (the length of the impulse response) into the midamble 20 or guard period 18. 
This extension allows for substantially all the multipath components of the last data 
symbols of the field to be used for data detection. As shown for data field 1 22, the 
samples are extended into the midamble by W-l chips. The midamble sequences are 
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cancelled from the samples taken from the midamble 20 prior to data detection 
processing. For data field 2 24, the samples are extended into the guard period 1 8 by 
W-l chips. 

[0067] Certain FFT implementations required a certain field length for analysis. 
One of these FFT implementations is a prime factor algorithm (PFA). The PFA 
implementation requires the field length to be a prime number, such as 61. To 
facilitate PFA FFT implementation, the samples used to determine r are preferably 

extended to a desired PFA length. As shown in Figure 5, data field 1 or data field 2 
are extended by P chips to the desired PFA length. Alternately, block FFTs of 61 
symbols are extended to block FFTs of length 64, which requires 2 n FFT calculations. 
Since the approximation of R to a block circulant matrix is reduced, the performance 
typically improves. 

[0068] An analysis of the computational complexity of fast joint detection is as 
follows. The computational complexity of calculating A is K-SF-W. The 
computational complexity of calculating A H A is per Equation 49. 



(k 2 + k) (k 2 -k) 

l—l[2(SF + W- !)-(/;_ - l)]-^r*--- — ~ — '-{SF+W- 1) 



where n max = nan[N s ,{SF + W- l)/SF)+l) Equation 49 



[0069] Calculating (^"cj A as a matrix-vector multiplication has a computational 
complexity of K N, (SF + W-l). Calculating the FFT of a j* column-block of R 

requires K 2 - (N g \o^N t ) calculations. Calculating the Fourier transform of A H r 
requires K( N, log2 N, ) calculations. The inversion of each matrix [A without 
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Cholesky decomposition, requires K 3 calculations. For N s frequency points, the total 
number of calculations is N s K 3 . Calculating [F(d)] k = [a^J ^F^r)] requires 

(K 2 ) multiplications for N s frequency points. Accordingly, the total number of 
calculations is N s K 2 . The Inverse FFT of [F(d)] requires K( N s log2 N s ) calculations. 
[0070] To illustrate the complexity for fast joint detection, the million real 
operations per second (MROPs) for processing a TDD Burst Type I with N c = 976, SF 
- 16, K = 8, N s = 61 and W = 57 chips is determined. The calculations A, (A H A), a 

column-block ofR, are performed once per burst, i.e. 100 times per second. 

The calculations A* 1 !, F [A^ ], computing [^(^ and the inverse FFT of [F(d)] are 

performed twice per burst, i.e. 200 times per second. Four calculations are required 
to convert a complex operation into a real operation. The results are illustrated in 
Table 1. 



Functions executed once per burst MROPS 

Calculating A 3.0 

Calculating A H A 4.4 

Calculating F([R]j] 9.2614 

Calculating 12.4928 
Functions executed twice per burst 

Calculating A H r 28.11 

Calculating F[A H r] 2 J 154 

Calculating [Ftf)\ H^lW^l 3.1232 
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Inverse FFT of [F(d A )] 



2.3154 



Total number of MROPS required for fast joint detection 65.0182 

Table 1 

Note: in Table 1, (A^) was calculated directly as a matrix-vector multiplication. 



to 54.8678 MROPS. If FFTs are used to determine [A H rj, the complexity reduces 

from 65.0182 MROPS to 63.9928 MROPS. 

[0072] A comparison of the complexity of fast joint detection and other detection 
techniques is as follows. The complexity of the following three techniques for a TDD 
Burst Type I with SF = 16 and K = 8 is per Table 2. 

Technique MROPS 

Approximate Cholesky based Joint Detection, (JDChol) 82.7 

Single User Detection: Approximate Cholesky based 
Equalization followed by a Hadamard Transform based 
Despreading (SDChol) 205.2276 

Fast Joint Detection (JDFFT) 65.0 1 82 



[007 1] If LU decomposition is used to determine 



, the complexity reduces 




Table 2 
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[0073] The performance of the three detection techniques and a reference matched 
filtering (MF) data detection technique were compared by simulation over 800 
timeslots. The simulations used the precision provided by Matlab, i.e. no finite 
precision effects were considered. The simulations used channels specified by 
WCDMA TDD WG4; SF =1 6 and K = 8 and 12 and were performed for the downlink 
with no transmit diversity to facilitate comparison to SUD. 

[0074] As shown in Figures 6 and 7, respectively, for cases 1 and 3, the 
performance of fast joint detection, JDFFT, is very close to that of the Cholesky based 
Joint Detection, JDChol. The other data detection approaches did not perform as well 
as JDChol or JDFFT. For the tddWg4Case2 channel as shown in Figure 8. JDFFT 
shows some degradation compared to JDChol. It also performs similarly to the SUD 
based Cholesky algorithm, SDChol. For a high data rate service, such as a 2 Mbps 
service, as shown in Figures 9-11. JDFFT performs close to or slightly worse than 
JDChol and better than the other approaches. 

* * * 
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CLAIMS 

What is claimed is: 

1 . A method for detecting data from K data signals transmitted over a shared 
spectrum in a code division multiple access communication system, the method 
comprising: 

receiving and sampling a combined signal having the K transmitted data signals 
over the shared spectrum; 

producing a combined channel response matrix using codes and impulse responses 
of the K data signals; 

determining a block column of a cross correlation matrix using the combined 
channel response matrix, each block entry of the block column being a K by K matrix; 

talcing a fourier transform of a complex conjugate transpose of the combined 
channel response matrix multiplied to the combined signal samples; multiplying an 
inverse of a fourier transform of each block entry to a result of the fourier transform 
to produce a fourier transform of a data vector; and 

taking an inverse fourier transform of the data vector fourier transform to produce 
data of the K data signals. 

2. The method of claim 1 wherein the taking the fourier transform is by 
multiplying the conjugate transpose of the combined channel response matrix to the 
combined signal samples and taking fourier transform of a result of the conjugate 
transpose multiplication. 

3. The method of claim 1 wherein a LU decomposition of the block entries is 
used to determine the data. 
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4. The method of claim 1 wherein the data determining occurs over a data field 
time period of a time division duplex communication burst and the combined signal 
samples extend beyond the data field time period. 

5. The method of claim 4 wherein extended samples of the combined signal 
samples extends beyond the data field time period for a length of the impulse response. 

6. The method of claim 4 wherein the combined signal samples extends beyond 
the data field time period so that a length of the combined signals is a length 
compatible with a prime factor algorithm fast fourier transform. 

7. A receiver for use in a code division multiple access communication system, 
a transmitter transmits K data signals over a shared spectrum, the receiver comprising: 

means for receiving and sampling a combined signal having the K transmitted data 
signals over the shared spectrum; 

means for producing a combined channel response matrix using codes and impulse 
responses of the K data signals; 

means for determining a block column of a cross correlation matrix using the 
combined channel response matrix, each block entry of the block column being a K by 
K matrix; 

means for taking a fourier transform of a complex conjugate transpose of the 
combined channel response matrix multiplied to the combined signal samples; 

means for multiplying an inverse of a fourier transform of each block entry to a 
result of the fourier transform to produce a fourier transform of the data vector; and 

means for taking an inverse fourier transform of the data vector fourier transform 
to produce data of the K data signals. 
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8. The receiver of claim 7 wherein the talcing the fourier transform is by 
multiplying the conjugate transpose of the combined channel response matrix to the 
combined signal samples and taking fourier transform of a result of the conjugate 
transpose multiplication. 

9. The receiver of claim 7 wherein a Cholesky decomposition of the block 
entries of the diagonal matrix is used to determine the data. 

10. The receiver of claim 7 wherein the data determining occurs over a data 
field time period of a time division duplex communication burst and the combined 
signal samples extend beyond the data field time period. 

1 1 . The receiver of claim 1 0 wherein extended samples of the combined signal 
samples extends beyond the data field time period for a length of the impulse response. 

12. The receiver of claim 10 wherein the combined signal samples extends 
beyond the data field time period so that a length of the combined signals is a length 
compatible with a prime factor algorithm fast fourier transform. 

13. A receiver for use in a code division multiple access communication system, 
a transmitter transmits K data signals over a shared spectrum, the receiver comprising: 

an antenna for receiving a combined signal having the K transmitted data signals 
over the shared spectrum; 

a sampling device for sampling the combined signal; 

a channel estimator for estimating impulse responses of the K data signals; and 
a data detection device for producing a combined channel response matrix using 
codes and the impulse responses of the K data signals; for determining a block column 
of a cross correlation matrix using the combined channel response matrix, each block 
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entry of the block column being a K by K matrix; for taking a fourier transform of a 
complex conjugate transpose of the combined channel response matrix multiplied to 
the combined signal samples; multiplying an inverse of a fourier transform of each 
block entry to a result of the fourier transform to produce a fourier transform of a data 
vector; and for taking an inverse fourier transform of the data vector fourier transform 
to produce data of the K data signals. 

14. The receiver of claim 13 wherein the taking the fourier transform is by 
multiplying the conjugate transpose of the combined channel response matrix to the 
combined signal samples and taking fourier transform of a result of the conjugate 
transpose multiplication. 

15. The receiver of claim 13 wherein a Cholesky decomposition of the block 
entries of the diagonal matrix is used to determine the data. 

16. The receiver of claim 13 wherein the data determining occurs over a data 
field time period of a time division duplex communication burst and the combined 
signal samples extend beyond the data field time period. 

1 7. The receiver of claim 1 6 wherein extended samples of the combined signal 
samples extends beyond the data field time period for a length of the impulse response. 

18. The receiver of claim 16 wherein the combined signal samples extends 
beyond the data field time period so that a length of the combined signals is a length 
compatible with a prime factor algorithm fast fourier transform. 
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Fig. 4 



DETERMINE A COMBINED CHANNEL RESPONSE MATRIX, A, 
USING THE ESTIMATED RESPONSES, fc (k) , AND THE 
SPREADING CODES, c«, FOR EACH BURST 



FORM THE COMBINED CHANNEL CORRELATION MATRIX R = A A 



A K BY K MATRIX A w , IS DETERMINED BY TAKING AN 
FFT OF BLOCK ENTRIES OF A BLOCK COLUMN OF R, 
PREFERRABLY, A CENTRAL COLUMN 



A HERMETIAN OF THE COMBINED CHANNEL RESPONSE 
MATRIX A H , IS MULTIPLIED TO THE RECEIVED VECTOR, 
l , AND A FFT OF A RESULT OF THE MULTIPLICATION IS 
TAKEN AT EACH FREQUENCY POINT; ALTERNATELY, 
[F(A H t)] k CAN BE FORMED BY TAKING AN FFT OF A 
BLOCK COLUMN OF A AND MULTIPLYING IT WITH 
AN FFT OF i . 



PERFORM LU DECOMPOSITION ON A W , A ( 1 = LU, WHERE 
L IS A LOWER TRIANGULAR MATRIX AND U IS AN UPPER 
TRIANGULAR MATRIX IS 1 



PERFORM FORWARD SUBSTITUTION, L y = [F (A t )h 



PERFORM BACKWARD SUBSTITUTION TO SOLVE 
FOR[FU)h,UMFU)U = J 



AN INVERSE FFT OF[F (i)]k IS TAKEN TO DETERMINE 
THE OATA VECTOR 
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tddWg4Case1 channel, SF = 16, K = 8 
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tddWg4Case2 channel, SF = 1 6, K = 8 
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tddWg4Case1 channel, SF=16, K = 12, High data rate service 
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